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The minimum probability of error (MPE) measurement discriminates between a set of candidate 
quantum states with the minimum average error probability allowed by quantum mechanics. Con¬ 
ditions for a measurement to be MPE were derived by Yuen, Kennedy and Lax (YKL) [I]. MPE 
measurements have been found for states that form a single orbit under a group action, i.e., there is 
a transitive group action on the states in the set. For such state sets, termed geometrically uniform 
(GU) in [2], it was shown that the ‘pretty good measurement’ (PGM) attains the MPE. Even so, 
evaluating the actual probability of error (and other performance metrics) attained by the PGM on 
a GU set involves inverting large matrices, and is not easy in general. Our first contribution is a 
formula for the MPE and conditional probabilities of GU sets, using group representation theory. 

Next, we consider sets of pure states that have multiple orbits under the group action. Such states 
are termed compound geometrically uniform (CGU). MPE measurements for general CGU sets are 
not known. In this paper, we show how our representation-theoretic description of optimal meas¬ 
urements for GU sets naturally generalizes to the CGU case. We show how to compute the MPE 
measurement for GGU sets by reducing the problem to solving a few simultaneous equations. The 
number of equations depends on the sizes of the multiplicity space of irreducible representations. For 
many common group representations (such as those of several practical good linear codes), this is 
much more tractable than solving large semi-definite programs—which is what is needed to solve the 
YKL conditions numerically for arbitrary state sets. We show how to evaluate MPE measurements 
for GGU states for some examples relevant to quantum-limited classical optical communication. 


I. INTRODUCTION 

Optimal discrimination of quantum states is central to 
a large number of key problems in quantum information 
theory. Quantum state discrimination finds applications 
in: (i) computation —for instance, in quantum algorithms 
for hidden subgroup problems Hi]; ( ii) sensing —for 
instance, in task-specific optical imaging j5], quantum 
reading [6], and pixelated image discrimination m ; (iii) 
communication —for instance, in decoding error correct¬ 
ing codes for classical communication over a quantum 
optical channel [SHlI] and optimal M-aiy phase discrim¬ 
ination under a photon budget constraint [12) . The 
problem of describing a quantum measurement to op¬ 
timally discriminate between a set of quantum states, 
i.e., to optimize a given metric averaged over the states 
and the transition probabilities induced by the meas¬ 
urement, was first considered by Yuen, Kennedy and 
Lax ^ , who showed that the optimal measurement is one 
whose measurement operators satisfy a particular semi- 
definite program, which is described later in this paper 
(we use optima/measurement, minimum probability of er¬ 
ror (MPE) measurement and Yuen-Kennedy-Lax (YKL) 
measurement interchangeably). For arbitrary states how¬ 
ever, finding the solutions of the YKL semi-definite pro¬ 
gram can be computationally hard. However, |13j have 
shown that a certain measurement called the pretty good 
measurement (PGM) comes close to the optimal meas¬ 
urement for arbitrary states. Moreover, upper and lower 
bounds on the MPE have been recently obtained for the 
general problem [14]. Restricting to pure states with a 
group symmetry makes the problem of finding the exact 
MPE more tractable. Helstrom considered the problem 


of finding the optimal measurement and the exact MPE 
for states that have cyclic group symmetry [15] . This 
was extended to arbitrary abelian groups with a transit¬ 
ive action by Forney and Eldar m- In fact, they showed 
that the pretty good measurement (PGM), also called the 
least squares measurement (LSM), as defined in [T7] and 
[18] . is optimal in this case. Later, for any non-abelian 
group with a transitive action on the states, Eldar et 
ah, [T9| showed that the PGM was again optimal. How¬ 
ever, an expression for the MPE was not known. In this 
paper, we fill this gap using group representation the¬ 
ory. In |2Q], the YKL conditions for the optimal meas¬ 
urement was were written in terms of equalities (rather 
than involving an inequality, which makes it harder to 
check). Recently, in [^, a generalization of the optimal 
measurement to non-projective measurements has been 
considered. 

When the states are compound geometrically uniform 
(CGU), i.e., they have multiple orbits under the group 
action (restricting to group actions that are permutation 
representations), very little is known about the struc¬ 
ture of the optimal measurement. Solving the general 
CGU state discrimination problem is particularly use¬ 
ful for designing the optimal decoder of linear codes for 
classical quantum (cq) channels, i.e., for sending clas¬ 
sical data over a quantum (such as, an optical) channel. 
It was recently shown that a generalization of Arikan’s 
polar codes [22] can achieve the Holevo capacity of any 
cq channel m- 

Any linear code has an automorphism group and the 
action of this group on the code is a permutation action 
and hence is a CGU set. This group action carries over, 
in general, to modulated code words. For example, for 
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binary codes over a binary-phase-shift coherent-state al¬ 
phabet (Ia), I — a)), the bit flip operation maps to the 
TT phase-shift operation in the modulated domain, i.e., 
UttI i ct) = I T a), where Therefore, find¬ 

ing the optimal decoder of a set of pure states with a 
CGU action will not only enable finding the best per¬ 
formance of any coherent-state-modulated linear code, 
but also lend useful insight towards designing structured 
optical receivers to realize the optimal measurement. In 
[5], Forney gives several examples of codes whose auto¬ 
morphism groups have a GU action on the code space. 


In this paper, we consider both the GU and the GGU 
actions on sets of linearly-independent pure states. In 
the case of GU action, we show how one can use repres¬ 
entation theory to explicitly calculate the probability of 
error and the conditional probabilities. Then we consider 
the GGU action and show that one can reduce the prob¬ 
lem to a set of simultaneous equations. The number of 
these equations depends on the sizes of the multiplicity 
spaces of the representation and the number of orbits. 
We will present examples to show the usefulness of this 
method when the representations have small multiplicit¬ 
ies and few orbits. In particular, we will show examples 
of families of codes of length N and rate R (i.e., number 
of codewords to discriminate, n = 2^^), such that the 
YKL conditions give rise to simultaneous equations, 
whereas our method requires solving a small constant 
number of simultaneous equations. This number comes 
from the dimension of the multiplicity spaces as we ex¬ 
plain later and is independent of N. 


This paper is organized as follows. In Section |TTj we 
describe the problem of discriminating between quantum 
states when the states are pure and are linearly independ¬ 
ent. We describe the gram matrix approach to comput¬ 
ing the YKL measurement given in Helstrom m along 
with a caveat about this approach. In Section m we 
present some examples of brute-force calculations of the 
MPE measurement for examples relevant to optical com¬ 
munication, and demonstrate why this technique is not 
scalable. In Section EYi we describe the optimal meas¬ 
urement for GU states. This description generalizes the 
ones in mnn] for abelian groups to non-abelian groups 
using group representation theory. Then in SectionjVj we 
generalize the results from the previous section, and de¬ 
scribe how one can obtain the optimal measurement for 
GGU states. We show how to reduce the number of sim¬ 
ultaneous equations based on the representation of the 
group, from the full number of equations as specified by 
the YKL conditions. Then we give examples to illustrate 
this method, and its usefulness in the context of optical 
communication. Finally, in Section [VI[ we conclude the 
paper with a summary and open questions. 


II. OPTIMAL MEASUREMENTS FOR PURE 
STATES 

In this section, we describe two approaches to finding 
the optimal measurement. Suppose we are given an en¬ 
semble {pi,\ipi)} ,1 < i < n, oi n linearly-independent 
pure states and an associated prior distribution. It can 
be shown that when distinguishing pure states, the op¬ 
timal measurement is an n-element projective measure¬ 
ment, and is unique m- Therefore, let us assume that 
the optimal measurement is given by the orthonormal 
basis {|rci), 1 < * < n}. Now define two matrices: the 
matrix M whose columns are the un-normalized pure- 
state vectors and the matrix X whose elements 

are Xij = .,Jpj{wi\4’j)■ Since the states are linearly inde¬ 
pendent, each state lies in an n dimensional Hilbert space 
and M is an invertible n x n matrix. The matrix X de¬ 
notes the solution to the state discrimination problem 
since all the information about the measurement vectors 
can be obtained from X. Yuen, Kennedy and Lax showed 
that X must satisfy the equations [T]: 

Y^Y = r, and (1) 


where F is the Gram matrix of the set of states, i.e., 
(r)ij = {ipiltpj). In Helstrom’s book [15], it is suggested 
that these equations lead to the solution. However, we 
would like to emphasize here that these two equations 
alone do not give a unique solution. In it was shown 
that along with the above two equations, an inequality 
must be satisfied. Only when the inequality is considered, 
one gets a unique solution in general. However, in certain 
cases of interest, one can get a small set of solutions using 
the above two equations as we show later. 

It is useful to view this in terms of the polar decom¬ 
position. The left and right polar decomposition of the 
matrix M is given by, 

M = Uy/WM = y/MMW. (3) 

In the above equation, is just the Gram matrix F 

of the set of states {|^i),..., |V^„)} and U is a unitary 
matrix. Denote VMfM by P. It is known that if M is 
invertible, then P and U are unique, with P being a pos¬ 
itive semi-definite matrix. Glearly, P satisfies P^P = T 
as does any matrix of the form UP, where V is unitary. 
Since VP always satisfies Eq. Q for any unitary U, it 
is chosen so that VP satisfies Eq. ([^ as well. Therefore, 
the matrix X is in general of the form VP and the meas¬ 
urement vectors [tCi) are columns of the matrix UUG 
Einally, note that if the solution X turns out to be such 
that PkXkk = PmX mrm then Eq. (|2j) becomes x^m — 
for all k and m, i.e., X is Hermitian. Below, we will show 
that the above condition is satisfied when {|'0i)} is a GU 
set. 





3 


III. BRUTE FORCE CALCULATION OF THE 
MPE MEASUREMENT 

Helstrom calculated the optimal measurement for sev¬ 
eral simple examples |15j . by assuming that any sym¬ 
metry in r is carried over unaltered, to X. For example, 
he considered the equiprobable ternary (n = 3) coher¬ 
ent state set {| — a), |0), |a)}, a £ R. Since the in¬ 
ner products, (±a|0) = (0| ± a) = = k, and 

(±a|=Fa) = = k‘^, this ensemble has an ‘isosceles’ 

geometry. He argues therefore, 

( 1 K K \ / a d d\ 

K 1 K^\^X=ibce\. ( 4 ) 
y b e c J 

With this assumption of symmetry for X, the = 9 sim¬ 
ultaneous equations resulting from the YKL equality con¬ 
ditions Eqs. Q and (|^ reduce to 5 simultaneous equa¬ 
tions in a, b, c, d, e, which Helstrom solved (four variables 
were eliminated analytically, and the last one was solved 
for numerically using the Newton method) to obtain (all 
entries of) X solely in terms of cr, and hence evaluated 
the average probability of error, P^ = 1 — (|ap -|-2|cp)/3. 

A rate R code C of length N codewords over a binary 
phase-shift keying (BPSK) alphabet {|a),| —a)}, has 
n = 2^^ codewords. Each row of the n x n Gram mat¬ 
rix of the codebook, F is a permutation of the first row, 
if C is a linear code. Given this symmetry, the sim¬ 
ultaneous equations of the YKL conditions reduce to n 
equations, since each row of X must also be the same 
permutation as the corresponding row of F. One might 
assert—based on Helstrom’s argument on the symmetry 
in F carrying over to X —that the number of distinct 
entries 0 < d < n in each row of X, would be the same as 
the number of distinct Hamming weights (d) in the code. 
Note that ((r))^^ = with tr = (a| — a) = 

where we define the elements dij of the code’s ‘distance 
matrix’ D to be the Hamming weight between the 
and the codewords. The aforesaid assertion was used 
in Refs. png to calculate the MPE measurements for 
some simple BPSK codes, including the first order binary 
Reed Muller RM(1 ,to) codes. 

Upon some numerical investigation, we found the 
aforesaid assertion to be false. We found examples of 
subcodes of the first order binary Reed Muller code, for 
which identical entries in one row of F resulted in dis¬ 
tinct entries in the corresponding row of X (see Fig. [^. 
This breaking of symmetry from F —> X led us to look 
into multiple orbits, and develop the mathematics to rig¬ 
orously understand the optimal measurements for CGU 
sets, which encapsulate all linear codes. This in turn 
led us to generalize and simplify the previous results for 
GU sets as well, which are presented in Section |rV| Our 
general results on MPE measurements for GGU sets are 
presented in Section |Vj 


■ 0 

4 

4 

4 

2 

4 

2 

4 ' 


a 

h 

c 

b 

d 

b 

d 

b ' 

4 

0 

4 

4 

4 

2 

4 

2 


b 

a 

b 

c 

b 

d 

b 

d 

4 

4 

0 

4 

2 

4 

2 

4 


c 

h 

a 

b 

d 

b 

d 

b 

4 

4 

4 

0 

4 

2 

4 

2 

^ X = 

b 

c 

b 

a 

b 

d 

b 

d 

2 

4 

2 

4 

0 

4 

4 

4 

d 

h 

d 

b 

a 

b 

c 

b 

4 

2 

4 

2 

4 

0 

4 

4 


b 

d 

b 

d 

b 

a 

b 

c 

2 

4 

2 

4 

4 

4 

0 

4 


d 

b 

d 

b 

c 

b 

a 

b 

4 

2 

4 

2 

4 

4 

4 

0 


b 

d 

b 

d 

b 

c 

b 

a 


Figure 1. Solving the YKL conditions numerically for the 
Gram matrix of a [8, 3, 2] sub code C of the BPSK (|a), | —a)) 
Reed Muller (r = 2, m = 3) [8, 7, 2] code, results in the 
number of distinct elements in a row of X to be one more than 
the number of distinct Hamming distances in C. Elements of 
the Gram matrix P, "/ij = , where ct = (q:| — a) = , 

and dij are the elements of the distance matrix D of the code. 
The 4’s in boldface red font in each row of D results in an 
entry at the corresponding position in the row in X, that is 
distinct from the entries in the row of X corresponding to 
the other 4’s in the row of D. For \a\‘^ = 0.01, solving this 
example numerically yields a = 0.54, b = 0.294, c = 0.263, 
and d = 0.382. 


IV. GEOMETRICALLY UNIFORM STATES 

AND THE PRETTY GOOD MEASUREMENT 

We say that a set of states is geometrically uniform 
(GU) if there is a group G acting transitively on them 
i.e., for every two states |'0i) and I'i/’j) there exists a group 
element g £ G such that i?(g)|'0i) = where R is 
some representation of the group G. This implies that 
all the elements of the set are obtained from a single ele¬ 
ment, say by the action of the group. If the states 
are linearly independent, then the representation of the 
group on the space spanned by the states is the induced 
representation of the trivial representation of the stabil¬ 
izer subgroup of IV’i)- For a state discrimination problem 
to be GU, one usually assumes that the priors associated 
with the states in the GU set are the same for all states. 
The PGM was proved to be optimal for cyclic groups 
in [15j . for abelian groups in m and for non-abelian 
groups in m- The pretty good measurement has been 
defined in m and m as a measurement to discriminate 
between the states pi with priors pi. The measurement 
operators of the PGM are given by H^ = pip~^/'^, 
where p = If iFe states are pure {pi = \ipi){ipi\) 

and linearly independent, this measurement becomes a 
projective measurement. Consider the polar decomposi¬ 
tion of the matrix M (defined in Eq. (§)• Observe that 
p = MM^ and so U = p~^/'^M. The columns of U form 
the measurement basis of the PGM. From the left po¬ 
lar decomposition, notice that the columns of U are also 
the measurement basis if the solution matrix X coincides 
with P = \/MtM. Since P is Hermitian, it would be the 
solution of Eqs. and if, in addition, all the diagonal 
elements of P are equal (since the priors are equal). To 
see that this is true for geometrically uniform states, ob¬ 
serve that p commutes with the representation R and 
Xkk = {wkltpk) = {wi\R{g)~'^R{g)\ipi) = (wi|^/’i) = xn. 

Now we describe the measurement using non-abelian 
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group representation theory along the lines of [m US] 
where it was done for abelian groups. In accordance with 
the action of the group, we have \ipi) = U{gi)\il)i) for any 
i. We assume that the priors pi are all the same. Any 
transitive permutation action on a linearly independent 
set is an induced representation. We pick a base point, 
say I'i/'i) and with respect to this point, there is a sub¬ 
group Go of G which stabilizes |'0i). The representation 
on the vector space spanned by \ipi) is the induced rep¬ 
resentation of the trivial representation of Go to G. If 
the set of states is S, then we have that l^l = |G|/|Go|. 

The Yuen, Kennedy, Lax conditions are 

T - Pi'i/’i > 0> 

(T = 0, and 

T = ^PiV'illi = ^Pilli'i/'i, (5) 


where ipi = |^/’i)('0i|. Since the optimal measurement 
basis for a GU set is also GU, it is easy to see that we 
only need the equations where in the first two i = 1. 

Let the optimal measurement basis be given by {|rwi)} 
which are also GU under the G action (and let 11^ = 
I Wi) (iCi I). Therefore we have 


Using the orthogonality relations among matrix entries 
of irreducible representations, we obtain 

(A, m, fc|T|A', m!, k') = y^(A, m\ 1). 

i 

One can check that T is block diagonal with the blocks 
given by multiplicity spaces. Let (j) = (1/|5'|)V’- Now 
using the YKL equations, inside these invariant spaces, 
we see that 


(A, m, fc|(Tn — 0 n)|A', to', k') = 0 

= ,m',k' ~ ^ X,m,k^\\m',k' =0) (8) 

where Xx^rn,k = (A,to, fc|0) and 7 A,m.fe = {X,m,k\w). In 
order to find the optimal measurement, we need to solve 
for 7 . Suppose that 7 A.m.fe = a;A.m.fc/cA,m, where we 
need to solve for c\,m- We have from Eq. [^that TA,m = 
Cx,m{4‘\w), if 7A.m.fe 7 ^ 0. We have that 


Ta. 



Xla;A.m,fe7Xm.fc(</>h) 

k 


l-gl Efc |a^A,m,fcP 

^A CA,m 




(9) 


' ' geG 

Now let this representation consist of irreducible repres¬ 
entations A with multiplicity mx- Consider the Fourier 
basis |A,to, fc) where A labels the irreducible represent¬ 
ation, TO its multiplicity and k its representation space 
whose dimension is denoted dx- The matrices U{g) are 
block diagonal in this basis and therefore the operator 
T is also block diagonal by Schur’s lemma. In order to 
find the probability of error, we need to access to an ar¬ 
bitrary matrix element of T inside the blocks. Following 
Helstrom |15j (who worked this out for cyclic groups), we 
have 


(A, TO, fc|T|A', to', k') = 

-^ '^{X,m,k\U{g)tjjiUiU{g~^)\X',m',k'). ( 6 ) 

' ' g 

We denote V'l and Hi as V’ and 11 respectively. The action 
of any U{g) on the state |A,to, fc) is given as follows 

U{g)\X,m,k) = '^X{g)k',k\X,m,k '), 

k' 


where X{g)k',k is the fc', k matrix entry of the irreducible 
representation A. Using this we get 

(A, TO, fc|T|A', to', fc') = 


Using the above two equations for Tx,m we get that 
l>S'IEfcl^A,m,fcP ^ ^ 

dx(cx,mr 

The solution of the above equation is 


CA,m — 


Efc l‘5'||a:A.m,fe| 


We also need to check that T — </) > 0 for this solution. In 
order to do this, let \fi) be an arbitrary normalized state. 
Then the above equation becomes {p\{T — (j))\p) > 0. The 
left hand side can be written in the Fourier basis as 


E X,m 


X.m.k 


E ,m,kf^X,m,k 


X.m.k 


= E 

A,m,fe 


A,m,fc| CA,m 


E 

A' ,m' ,k' 


\^X' ^m' ,k' 


CX' 


E! ^A,m ,kf^X,m,k 

X,m,k 


( 10 ) 


Now consider the second half of the above expression 



2 

2 

®A,m,fcP-A,m,fc 

= 

'y ^ \/CA,m7A,m,fc\/CA,mP-A,m,fc 

X,m,k 


X,m,k 


< E ia^a.^aEa.™ E (II) 

A,m,/c X',m' ,k' ^ 
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where the last line was obtained through Cauchy- 
Schwartz. This shows that the 7 are the solutions. We 
can assume that the basis of the multiplicity space is 
picked in such a way that x\^rn,k is non-zero for only one 
TO. Then ca,™ = /d\ and 


\w) 


E 

X,m,k 


|A, TO, k) 


^X,7n 


I d\ P\\'4’) 

A V VWdXW) ’ 


where P\ is the projector onto the isotypic space A. 

We now calculate the probability of success using this 
expression. The probability of success is given by Pg = 
|(w|'0)p. This can be written as 


Pg = 


E \J {'^\Px\i^) 


For any group with a representation U, an expression for 
P\ is given by (for a character x\) 


One can simplify the expression {iplPxlip) as follows. 

i^lPxW = 1 ^ E Xx{9~^){^\Uig)\^). ( 12 ) 

' ' geo 


But we have {iplU{g)\ilj) = (V’|tA( 5 iS 52 )|^) for all 51,52 G 
Go, where Gq is the stabilizer group of IV’) i.e., the sub¬ 
group of G such that U{g)\ijj) = \ip), V 5 G Gq. This 
means 

^ E Xx{g~'^){i^\U{g)\i;), (13) 

' ' i.geCi 


where z is a sum over (Gq, Gq) double coset representat¬ 
ives and Ci is the double coset. This sum can be further 
simplified to 


i^lPxW = xxig ^), (14) 

I I I geCi 

where gi is the double coset representative of the double 
coset Ci- This can further be written as 


where xa(G,) = {l/\Go\)Y.geC,Xx{g ^)- This gives 
an explicit formula for the probability of success. This 
means that if we can find the sum of the character val¬ 
ues of the elements of a double coset easily, then we can 
obtain the formula for the probability of success. In the 
next subsection, we show how to do this for a specific 
case. 


A. GU example from optical communication 


Let us first consider the example of the iV-ary optical 
pulse position modulation (PPM), which has N code¬ 
words each consisting of N modes, only one of which 
is excited in a coherent-state pulse |a), a G M, where 
n = |ap is the mean photon number of the pulse. Each 
row of the Gram matrix has two distinct entries: (a) one 
diagonal entry, = 1, and (b) N — 1 entries cor¬ 

responding to distinct codewords, {ipi\il)j) = where 
K = (0|a) = The PPM state set is clearly GU 

under cyclic group action, for which Helstrom’s calcula¬ 
tion of the error probability m can be applied to obtain 


pMPE ^ ^1 + (]V_1)«2 _ 7137(2 


This was 


also independently obtained earlier by Liu in [24j . 

The standard receivers employed in optical communic¬ 
ation are homodyne detection, heterodyne detection, and 
direct detection. The optimal standard receiver meas¬ 
urement for demodulating PPM is direct detection. The 
quantum-noise-limited direct detection measurement is 
realized by an ideal photon-number resolving (PNR) de¬ 
tector, which is a projective measurement on the photon 
number basis {|0), |1),...}. In this basis, a coherent state 
la) = ^1^)- ideal PNR detector will 

successfully discriminate the PPM codewords if either (a) 
the pulse position in the codeword successfully generates 
a ‘click’ (which happens with probability 1 — e“"), or (b) 
if the pulse fails to generate a click but the receiver still 
chooses the correct codeword purely by chance. Assum¬ 
ing all the codewords are equally likely, the average prob¬ 
ability of error, _ N^^-n ^ Dolinar showed that in 

the ‘high photon number’ regime {Ne~'^ 3 1 to be pre¬ 
cise), ^ e“", whereas P^pe ^ [25]. Thus, 

the MPE has a factor of two higher error exponent com¬ 
pared with quantum-noise-limited direct detection, in the 
high-photon number limit. Even though the design of a 
structured optical receiver that can exactly attain P^^^ 
at any value of h remains unknown, there are receiver 
structured known—the conditional pulse nulling (CPN) 
receiver for discriminating PPM codewords |23[2S|) and 
the sequential waveform nulling (SWN) receiver which 
works for discriminating any N coherent state code¬ 
words [26] —both of which can attain the optimal error 
exponent (i.e., that of the MPE measurement) in the high 
photon number limit. 

Let us recall that the PGM was proved to be the op¬ 
timal (MPE) measurement for: (a) cyclic groups in [15] . 
for (b) abelian groups in |16] . and for (c) non-abelian 
groups in [111 . For the cases dealt in (b) and (c), no 
systematic method to calculate the minimum error prob¬ 
ability was given. Hence, we consider next an example 
from optical communication that truly demonstrates the 
power of our method since the group involved is non- 
abelian, and hence no systematic method to calculate the 
MPE (other than by brute-force evaluation) is known. 
Two-pulse PPM is a modulation constellation contain¬ 
ing (^) codewords. Each codeword consists of N modes. 
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whose state is a tensor product of coherent state |a) in 
two of the N modes, and vacuum (|0)) in the remain¬ 
ing N — 2 modes. Each row of the Gram matrix there¬ 
fore has three distinct entries: (a) one diagonal entry, 
= Ij (b) {^ 2 '^) entries corresponding to code¬ 
word pairs both of whose pulses are in non-overlapping 
modes, i.e., (V'ilV’i) = where k = (Ojo;) = 
and finally, (c) remaining entries corresponding to code¬ 
word pairs one of whose pulses are in the same mode, 
i.e., ii’ili’j) = 

We assume some familiarity with the representation 
theory of the symmetric group. The symmetric group Sn 
acts on this set of states and in this representation oi 
there are three irreducible representations. The trivial, 
standard and the a third irrep whose Young diagram has 
two rows with two boxes in the second row (label them 
a, 6, c respectively). In this case, Gg is S 2 x Sm -2 and the 
double coset representatives are e, (1,3) and (1,3)(2,4) 
(label these double cosets Gq, Gi and G2 respectively). 
The double coset sums turn out to be 

fN — 2\ 

Xa(Go) = l,Xa(Gi) = 2(iV-2),Xa(G2)= 2 j’ 

Xb(Co) = 1, Xb(Ci) = iV - 4, Xb{C2) = -{N-3), 

Xc(Go) = 1, Xc(Gi) = -2 , Xc(G2 ) = 1. (16) 

where Xxi^i) = (l/IGol) EgeC; Xa(5 "^)- With this the 
probability of success can then be easily calculated to be 


?MPE 


\/l + Xa(Gi)«;2 + Xa(G2)s;'‘ 


" N{N-l)[' 

+ {N- l)V^ + Xb{Ci)b^^+XbiC 2 )K^ 
"iVN 




-N]^/lVxJC^W+^aC^ 


(17) 


When K = 0, it can be seen that Pg = 1. This corres¬ 
ponds to the states being orthogonal to each other and 
the PGM is just the measurement in that basis (and it 
always succeeds). 

An ideal PNR detector will successfully discriminate 
the codewords if either (a) both pulse positions gener¬ 
ate a ‘click’ (each of which happens with probability 
1 — 1(01 Of) p = 1 — K^), (b) only one of the pulses gen¬ 
erate a click and a random guess among the — 1 re¬ 
maining pulse positions yields the second pulse position 
correctly by chance, or (c) neither of the pulses generate 
clicks but a random guess among all (^) codewords yields 
the correct choose by chance. The success probability is 
therefore given by. 


pPNR 

c 



2 (1 — 

A^- 1 



(18) 


The probabilities of error = 1 — P^p®, and 

pPNR _ ^ _ pPNR plotted as a function of the 
mean photon number n = |ap in Fig. Interestingly, 
numerical evaluation of these error probabilities for for 



Figure 2. (Color online) The probability of error of dis¬ 
criminating (^) two-pulse-PPM coherent-state codewords, 
for A = 8, plotted as a function of the mean photon num¬ 
ber h = |a|^. The blue (dashed) plot corresponds to the 
error probability achievable by a quantum-noise-limited PNR 
measurement, whereas the red (solid) plot is the minimum 
probability of error (MPE) achievable by the optimal meas¬ 
urement allowable by quantum mechanics. 


two-pulse PPM show that, just like PPM, in the high 
photon number limit (Afe“" <C 1), ~ e”", whereas 

pMPE c —2n 

c . 

It is worth noting that the SWN receiver [5H]— which 
can be built in principle using simple linear-optic com¬ 
ponents and single photon detectors—can attain the op¬ 
timal (MPE measurement’s) error exponent in the high h 
limit for discriminating any N coherent state waveforms. 
Hence for two-pulse PPM, pSWN ^ g-2n^ optical re¬ 
ceiver structure to exactly attain P^pe finite n is 

significantly more complicated, and requires truly non- 
classical (entangling) operations within the receiver |27) . 


V. OPTIMAL MEASUREMENT FOR CGU SETS 

In this section, we describe how to obtain the optimal 
measurement for CGU state sets. We use the Helstrom 
description of the problem of finding the optimal meas¬ 
urement for pure states i.e., by viewing it as the solu¬ 
tion of a set of simultaneous equations. However, as we 
pointed out earlier, one need not obtain a unique solu¬ 
tion. With every obtained solution, we have to check the 
third condition to find the right one. We again resort 
to representation theory to simplify the equations and 
obtain far fewer equations (in many practical cases of in¬ 
terest). We begin by recalling the set of simultaneous 
equations which give the solution given above in Eqs. 
and WlX = P, XkmXtim = Xkkx’^mk where P is the 
Gram matrix of the set of states. Since we have CGU 
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Figure 3. (Color online) The probability of error of discrim¬ 
inating 2N binary-phase-coded PPM codewords, for N = 8, 
plotted as a function of the mean photon number h = \a\'^■ 
The magenta line (circles) correspond to the error probabil¬ 
ity achievable by an ideal homodyne detection measurement, 
the blue (dashed) plot is the error probability achievable by 
a non-standard yet structured receiver described in the text, 
and the red (solid) plot is the minimum probability of error 
(MPE) achievable by the optimal measurement allowable by 
quantum mechanics. 

symmetry in the problem, the Gram matrix is symmet¬ 
ric about a group G and its representation U(g). Suppose 
this representation decomposes into irreducible spaces A 
of dimension d\ with multiplicity m\ (as before). Then 
the Gram matrix is block diagonal in this basis with a 
block corresponding to each irreducible representation. 
Denote these blocks as . These blocks also have a spe¬ 
cial structure where they are identity in the representa¬ 
tion space i.e., = T;, GWe note that the solution 

X would have the same block diagonal decomposition 
since it commutes with the same representation of G. 
Then we have that inside an isotopic space Xt^Xx = T^. 
Notice that these matrices are of dimension mx x mx- 
Therefore, if the multiplicity spaces are small, then this 
task is easy. Now, solving for such an Xx can be done 
only up to a unitary since for any solution Xx, UxXx is 
also a solution, where Ux is an arbitrary unitary oper¬ 
ator. In order to find a solution, we need to use the set 
of equations in Eq. 

A. CGU example from optical communication 

CGU state sets are of particular importance in optical 
communication. This is because all linear codes have 
GGU symmetry, and explicit linear codes (viz., quantum 
polar codes) are known to achieve the quantum (Holevo) 
limit of the classical communication capacity over any 


quantum channel cni, including the lossy-noisy bosonic 
channel [ 28 ]. We now illustrate our method explained 
above with a non-trivial CGU example relevant to optical 
communication. Consider a modulation code comprising 
n = 2 N codewords, where each codeword is an A^-mode 
pure states. Further, N of the 2 N codewords comprise a 
PPM set with a coherent state |a) in the respective pulse 
positions, while the remaining N codewords comprise a 
PPM set with a coherent state |/ 3 ) in the respective pulse 
positions. This set clearly is CGU under the cyclic group 
action. Therefore, in the Fourier basis, the Gram matrix 
has N 2 X 2 blocks. The first block is 

/ 1 -b (A - l)e-l“l' C \ 

V C 1 -b (TV - l)e-I^IV ’ 

where C = exp(a/ 3 * — ±J^) -|-(A^ — 1 ) exp( ^°^ ). 

The other blocks are 

/l - A(A- l)e-l“l" D \ 

V D 1 -TV(TV-l)e-I^IV ’ 

where D = exp(a/ 3 *— ) —N{N— 1 ) exp( ^°^ ). 

Suppose that X has entries xn,... X22 for the first block 
and y’s for the other 2 x 2 blocks, then we obtain the 
equations 

{xi 2 -b (TV — I)yi 2 ){x 22 + (TV — 1)2/22) = 

{xii -b (TV - l)yii){x2i + (TV - 1)2/21) and 
(a:i2 - yi2){x*22 - 2/22) = 

(a:ii - yii)(a;2i - 2/21) ( 19 ) 

For many such orbits, these equations can be generalized 
easily. After solving these equations, one obtains the 
following optimal average probability of success, 

+ -l)ei+e 2 + {N-l)es 

+ \/l -b (TV - l)ei - 62 - (TV - 1)63 
-b (TV - 1)1/1 - ei -b 62 - 63 

,_ \2 1 

-b (TV - l)-/l - d - 62 + 63J ( 29 ) 

where Ci = exp(—|a/), 62 = exp(—Ija — /?/) and 63 = 
exp(-i(|a|2 -b |/3|2)). 

In order to compare the minimum probability of er¬ 
ror = 1 — pMPE .j-p average error probability 

achievable by standard optical receivers, let us consider 
the case of /3 = —a S K in above. The modulation 
format thus obtained is known as (binary) phase-coded 
PPM, or PCPPM. The best standard optical receiver to 
decode the 2 TV codewords is homodyne detection. Dir¬ 
ect detection can only discriminate between the TV pulse 
positions, but cannot discern the phase. This is because 
the mean photon number in the pulse, |/ 3 / = |a/ = h 
is the same for each phase. Ideal homodyne detection 
















of a coherent state |a) generates a Gaussian distributed 
random variable with mean a and variance There¬ 
fore, homodyne detection of all the pulse positions gener¬ 
ates N statistically-independent real-valued random vari¬ 
ables, of which one random variable X ~ Af{±a, j), and 
TV — 1 i.i.d. random variables ~ JV(0, |). The receiver 
first chooses the pulse position as the one the homodyne 
output corresponding to which has the largest absolnte 
value. Then it chooses the phase based on the sign of 
the real-valued homodyne output for that pulse position. 
The success probability is thus given by, 


= Pr [|X| > maxi<,<^_i {\Z,\}] 



where |X| and \Zi \ have folded normal distributions. We 
evaluated this numerically for N = 8 and plotted the 
error probability 1 — as a function of h, in Fig. 

(see magenta circles). We also calculated and plotted 
the MPE using our results (Eq. (20)) as a function of 
n (see solid red line). Our MPE calculation helps show 
how inferior the performance is (compared with the op¬ 
timal measurement) for a PCPPM modulation for the 
best standard optical receiver choice. 

Now we consider a non-standard, yet intuitive and 
structured optical receiver, to detect the PCPPM code¬ 
words: The coherent-state codeword impinges a photon 
counting receiver. If no click is registered over all the N 
modes (which happens with probability pq = 6“"), the 
receiver chooses randomly between the 2N codewords. 
The first photon click must identify the pulse slot cor¬ 
rectly. The photon arrivals within the time slot contain¬ 
ing the pulse (in state |a) or | — a)) are Poisson dis¬ 
tributed. As soon as the first click arrives—the time of 
arrival of which is random (exponentially distributed)— 
the remainder of the pulse (which is in a coherent state 
1/3) or I — / 3 ) with |/ 3 p < lap) is switched into a Dolinar 
receiver [25] . which identifies the phase correctly with 


an error probability 


1 — Vl — . It is straight¬ 

forward to show that the eventual probability of error 
attained by this structured receiver is given by: 


j^structured 


Po 


'2N- 1' 
2N 


1 -Po 
2 




which is plotted in Fig. (see dashed blue line). A 
numerical evaluation of the error probabilities for the 
MPE and the structured receivers show that, in the high 
photon number limit (A^e"" < 1), 
whereas ~ The SWN receiver can at¬ 

tain the MPE measurement’s error exponent in the high 
h limit for discriminating any N coherent state wave¬ 
forms, and hence applies to PCPPM as well. A simple 
structured receiver to exactly attain at any finite n 

is not known, but our calculation of the optimal measure¬ 
ment for CGU sets allows one to use the general receiver 
concept in Ref. [27], which—despite requiring complic¬ 
ated non-classical operations within the receiver—can in 
principle attain the MPE exactly at all n. 


VI. CONCLUSIONS 


We have developed a new and compact 
interpretation—using group representation theory—of 
the minimum probability of error (MPE) measurement 
for distinguishing a set of geometrically uniform pure 
quantum states—states that form a single orbit under 
the group action, i.e., a transitive action. We also give 
a representation theoretic proof that the pretty good 
measurement, or equivalently the least squares measure¬ 
ment is the optimal (MPE) measurement for a GU set of 
states. More importantly, this representation theoretic 
framework gives explicit formulae for the minimum 
probability of error. This is useful in comparing the 
relative performance of various receivers. Using the same 
framework, we then extended our analysis to construct 
optimal measurements for compound geometrically 
uniform (CGU) state sets, which are states that form 
multiple orbits under the group action. CGU sets appear 
in many practical problems, particularly in transmitting 
classical data over a (quantum) optical channel. All 
linear codes formed using pure-state modulation con¬ 
stellations, which are known to achieve the quantum 
(Holevo) limit to the capacity of optical communication, 
are CGU sets. We showed how to compute the optimal 
measurement for CGU sets by reducing the problem 
to solving a few simultaneous equations. The number 
of equations depends on the sizes of the multiplicity 
spaces of irreducible representations. For many group 
representations (such as those of several practical good 
linear codes), this is a lot more tractable than solving 
large semi-definite programs, in order to solve—by 
brute force—the Yuen-Kennedy-Lax conditions [1] for 
determining optimal measurements for discriminating 
an arbitrary set of pure states with given pairwise 
inner products. We showed one example each of the 
evaluation of optimal measurements for GU and CGU 
states, respectively. 

It is known that coherent-state (laser light) modula¬ 
tion is sufficient to achieve the Holevo capacity, the ul¬ 
timate rate of reliable classical communication over a 
lossy-noisy optical channel [521 [50]. It is also known 
that linear codes (over an underlying coherent-state mod¬ 
ulation) along with optimal measurements—which are 
CGU sets by definition—suffice to attain the Holevo ca¬ 
pacity [5S]. There is however a significant gap between 
the Holevo capacity and the Shannon capacity of the op¬ 
tical channel attainable by conventional optical receivers, 
viz., homodyne, heterodyne, and direct-detection receiv¬ 
ers |31j . and the gap widens in the low photon num¬ 
ber regime [55]. It would be interesting to investigate 
explicit finite blocklength code families with good sym¬ 
metry properties, whose rate performance along with the 
respective optimal measurements—calculated exactly by 
the general method we developed—bridges the aforesaid 
capacity gap. It would also be interesting to develop 
rigorous foundations for translating the optimal CGU 
measurement, to an algorithmic design of structured op- 
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tical receivers built using a small universal set of known 
optical components and ancilla states, that can imple¬ 
ment the optimal measurement on any given linear code. 
In the high photon number regime on the other hand, 
heterodyne detection is known to be asymptotically ca¬ 
pacity optimal. However, in the high photon number 
regime, the improvement attained by the MPE measure¬ 
ment (over conventional optical receivers) in the error 
exponent in discriminating symbols of a modulation con¬ 
stellation (as seen in our GU and CGU examples in this 
paper, and also in Ref. [5^) translates to a superior finite 
blocklength rate achievable by the MPE measurement, 
even though heterodyne detection is capacity-optimal in 


this regime [35]. This suggests that translating our de¬ 
velopment in this paper to an algorithmic design of struc¬ 
tured MPE-attaining optical receivers, may also have a 
benefit in the high photon number transmission regime, 
in terms of the finite-codelength rate performance. 
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APPENDIX 

We recall some basic facts of representation theory 
needed for the results in this paper and then show how 
to obtain the double coset sums in Eq. Given any 
finite group G and a complex vector space V, a linear 
map $ : G —>■ End(U) which takes the group identity 
to the identity endomorphism is called a representation 
of G. Often, the space V is called the representation. 
If there exists a subspace W such that the map $ is 
taken to the subspace for every element G, W is called 
a sub-representation or an invariant subspace of V. The 
orthogonal complement of W in U will also be an invari¬ 
ant space. For any invariant space, the space itself and 
the trivial subspace consisting of the zero vector is always 
an invariant space. If the only invariant spaces of V are 
the trivial space and itself, then V is called an irredu¬ 
cible representation. Every finite group has a finite set 
of irreducible representations (sometimes called irreps) 
associated with it. Any representation can be decom¬ 
posed into irreducible representations, where there may 
be many copies of a given irreducible representation in it. 
Invariant spaces of a group can be related to eigenspaes of 
an operator that is symmetric with respect to that group. 
More precisely, suppose that a matrix A commutes with 
a representation of a group G i.e., U{g)A = AU{g) for 
all g G G. Then the invariant spaces of G lie inside ei- 
genspaces of A. Suppose that U has the following block 
diagonal decomposition 

U (5) = ^ Imx G Hd) I 

A 

where m\ is the multiplicity space of the irrep A i.e., 
the number of times A appears in U. Then A has the 


decomposition given by 

A = ^ Ax G Idx I 

A 

where A\ is a matrix inside the multiplicity space and dx 
is the dimension of A. Note that if, in particular, mx = 1, 
then Ax is one-dimensional and therefore is an eigenvalue 
of A. Even if mx ^ 1 but are small, we only need to 
diagonalize Ax for all A to determine the eigenvalues of 
A. 

Now, we give the details of the calculations used to 
produce Eq. |16| The case for the trivial representation 
is simple. We explain here how to obtain the coset sums 
for the standard representation (denoted b) and the rep¬ 
resentation we denoted as c. We assume familiarity with 
the representation theory of the symmetric group. The 
double coset sum for Go is also trivial to obtain since 
Xa;(Go) (where x represents b or c) is the multiplicity of 
the trivial Gq representation in x. This, by Frobenius re¬ 
ciprocity, is the multiplicity of the representation x in the 
induced representation, which is 1 for both b and c. So, 
now if we evaluate the double coset sum Xx{Gi), then be¬ 
cause both b and c are non-trivial irreducible representa¬ 
tions, Xx{G 2 ) = — 1 —Xa;(Gi). Thus, we only need to eval¬ 
uate one double coset sum Xx{Gi) for b and c. In order to 
do this, note that the double coset sum is actually a mul¬ 
tiple of a coset sum i.e., Xx{Gi) = 2{N — 2)xa;((I3)Go). 
Now in order to evaluate Xa:((13)Go), find the trivial Go 
states in the induced representation. First, we repres¬ 
ent the states in the induced representation as |{*,i}) 
(and there are (2)0! these and they form an orthonor¬ 
mal basis of the induced representation). One can see 
that there are three trivial Go states (this follows from 
Frobenius reciprocity and the fact that this induced rep¬ 
resentation is multiplicity free). The first one is also a 
trivial Sn state and is l{bj}) (denote this by 

|t)). The second one comes from the standard S'jv rep¬ 
resentation (i.e., representation b). Note that the states 
in the representation b are given by 

k.)= E (21) 

Therefore, the (unnormalized) trivial Go state in b can 
be seen to be jsi) -I- [52) (denote this by \u)). Now, we 
can find Xb{Gi) as 

X.(Gi) = 2(7V-2)M^. (22) 

{u\u) 

This turns out to be IV — 4. Therefore, Xb(C’2) = 
— {N — 3). Now, in c, the trivial Go state turns out to be 
(denoted |u)). 

k) = |{l,2})-^|u)-^|t). (23) 

The double coset sum Xc(C'i) can be calculated as follows. 

Xc(Gi) = 2(iV-2)M^. (24) 
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This turns out to be —2. This means that Xc(C'2) = 
1. One can also double check by calculating Xb{C 2 ) and 


Xc(C'2) independently. 

(u|(13)(24)|u) (H(13)(24)|u) 

Xb{C2) = -- ,Xc(C'2) = -y-ry- 

\u\u) \'^P/ 


(25) 



